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SPREAD SPECTRUM RECEIVER ARCHITECTURES 
AND METHODS THEREFOR 



FIELD OF THE INVENTIONS 

The present inventions relate generally to spread spectnun receivers, 
and more particularly to programmable spread spectrum receiver architectures 
and methods therefor. 



BACKGROUND OF THE INVENTIONS 

Satellite based positioning system enabled receivers, for example 
Global Positioning System (GPS) receivers, are used v^dely for navigation and 
have substantial potential to provide location information in mobile wireless 
commtmication devices, including cellular telephones, which must soon comply 
with United States Federal Communications Commission E-911 location 
requirements. 

One of the foremost interesting specifications of a GPS receiver is the 
time required to acquire satellite signals, known as spread spectrum 
pseudorandom noise (PN) codes, and to provide position coordinates after power 
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is applied to the receiver. The time required to perform these operations is known 
as the time-to-first-fix (TTFF), which is determined generally by the hardware and 
software architecture of the receiver. 

In battery powered hand-held GPS receivers, including those 
embedded in cellular telephones, the acquisition time inflt^ces total battery life 
since the receiver is powered continuously during location determination. The 
time required to generate a location fix is also important in emergency location 
applications, for example in B-911 enabled cellular telephones. Another important 
consideration, partiailarly in hand-held GPS receivers, is the time to acquire 
signals in weak signal enviroiunents, for example in environments where the 
signal is obstructed by foliage, automobiles, urban canyons and buildings. 

It is known to search the code phase space for a single satellite in 
parallel. US Patent No. 6,009,118 to Tiemanrv for example, discloses 2046 parallel 
correlators that search all phase delays for a single satellite. In "Real Time Missile 
Tracking'', Proceedings of the ION Aerospace Meeting", April 1981, Wells 
describes a flash parallel correlator that computes correlation parameters for as 
many as 64 different phase delays of a single satellite. In these and other known 
parallel correlation schemes, a correlator block is assigned to search for one 
satellite over a pre-determined number of phase delays. In both Tiemann and 
Wells, however, ihe correlator block searched only for one satellite at a time. In 
these and other prior art systems, satellite signal searching remains a sequential 
process, wherein the parallel processing applies only to the phase delays for the 
particular satellite being searched. In Tiemaim and WeUs, searching or detecting 
more than one satellite in parallel requires duplication of the parallel correlators. 

Other examples of parallel correlators are described in US Patent No. 
5,901,171 to KohU, and US Patent No. 6,208,291 to Krasner. In these patents a 
parallel correlator circuit is duplicated N times in order to search for N satellites. 

In some applications, the satellite signal receiver has knowledge of 
the visible satellites, their approximate Doppler frequency, and in some cases, the 
approximate phase delay and phase/polarity of the 50 bits per second (BPS) 
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navigation message bits. This knowledge can come from locally stored ephemeris, 
almanac, approximate position, and time, or from other sources, for example, the 
IS-801 specification provides satellite visibility, Doppler, phase delay at a 
particular epoch time. Since ttiere are generally 8-10 satellites visible at any one 
time, searching in parallel for these signals shortens tihe total acquisition time. 

What is desired generally is an efficient spread spectrum signal 
searcher that substantially reduces tiKe average TTFF in a manner that minimizes 
the number of gates/ transistors and in some applications reduces power 
consiunption. 

The various aspects, features and advantages of the present 
invention will become more fully apparent to those having ordinary skill in the art 
upon careful consideration of Ihe following Detailed Description of the Invention 
with the accompanying drawings described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 is block diagram of an exemplary spread spectrum receiver. 

FIG. 2 illustrates GPS signal search space code phase and Doppler 
dimensions and a region thereof. 

FIG. 3 illustrates several exemplary search modes. 

FIG. 4 is a functional block diagram of an exemplary receiver, 

FIG. 5 is a block diagram of I & Q signal segment registers. 

FIG. 6a is an exemplary Doppler generator block diagram. 

FIG. 6b is an exemplary code phase format maintained by a code 
phase computer. 

FIG. 7 is an exemplary PN code generator block diagram. 

FIG. 8 is a detailed portion of the PN code generator of FIG. 7. 

FIG. 9a is an exemplary maihematical functional representation of 
the correlator. 
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FIG. 9b is an exemplary truth table logic. 
FIG. 9c is an exemplary correlator circuit. 

FIG. 10a is an exemplary schematic processing flow diagram with 
half-chip delay correlation increments. 

FIG. 10b is another exemplary schematic processing flow diagram 
with onekrhip delay correlation increments. 

FIG. 11 is an exemplary coherent integration modtde block diagram. 

FIG. 12 is an exemplary non-coherent integration module block 



diagram, 
diagram. 



FIG. 13 is an exemplary peak detector and output register block 



FIG. 14 illustrates offset code phases. 
FIG. 15 is a coherent integration diagram. 

FIG. 16. is a non-coherent signal magnitude versus integration offset 
time diagram. 



DETAILED DESCRIPTION OF THE INVENTIONS 

FIG. 1 is a block diagram of an exemplary spread spectrum receiver 
comprising generally an input signal integration and preprocessing (BIP) module 
10, a segmentation and register block 20, a flash correlator block 30, a PN code 
generator block 40, a Doppler generator block 50, a code phase computer block 60, 
a coherent integration block 70, a non-coherent integration block 80, a peak 
detector block 90, a real time clock 92, and a track processing block 94, a processor 
interface block 96 and a timing generator block 98. In a preferred embodiment of 
the present invention, a single hardware correlator block is time shared to search 
for N satellites in parallel. In other embodiments, the correlator block may be 
duplicated to provide greater parallelization. 
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The receiver may be operated and controlled by either a resident 
digital signal processor (DSP) or by some other processor, not illustrated, for 
example a handset call processor in applications where the receiver is integrated in 
a cellular handset. In one embodiment, the receiver is controlled as an on-chip 
peripheral via an address-mapped parallel bus. 

In its primary search for signals mode, in one embodiment, the 
receiver interrupts the controlling processor at a rate not exceeding 10 Hz, In a 
tracking mode, for example in GPS location applications, the receiver is 
programmable to interrupt at rates between 0 and 4095 in 1 ms steps, thus 
allowing processor software to control the code and frequency tracking loops. 
Alternatively, the receiver includes dedicated hardware for controlling the code 
and frequency tracking loops. 

To perform a sky search for a particular GPS satellite, all possible 
phase delays and all possible Doppler shifts must generally be searched. FIG. 2 
illustrates a two-dimensional total uncertainty space for a single GPS satellite SVl 
and a smaller uncertainty region thereof determined, for example, with assistance 
data, e.g., satellite almanac, ephemeris, approximate position, and approximate 
time. The imcertainty space and region thereof both comprise a pseudorandom 
noise code phase dimension (Code Phase Space) and a Doppler frequency 
dimension (Doppler Space). Since the Code Phase Space can be described as 1023 
possible PN code chips (N = 2046 search bins for half-chip spacing search tests) 
and the Doppler space can be described as +/- 5,000 Hz of Doppler uncertainty (M 
= 10) search bins of Doppler at a pre-detection-integration (PDI) of 1 millisecond, 
the total number of search bins is given by N x M = 20,460 search bins. In one 
embodiment of the present invention, all of these bins can be searched with a 
single correlator, wherein each search bin is searched sequentially, one at a time. 
The term PDI is also referred to as the coherent integration time of the integration, 
the terms are used interchangeably. 

At 1 millisecond per dwell, using a single correlator, 20.46 seconds 
are required to search all possible bins for a single satellite. The search time 
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increases very quickly as the PDI is increased because the dwell time lengthens 
and the number of frequency bins increases. More generally, there is also 
imcertainty in local time and in the receiver reference oscillator with the result that 
the number of search bins may be substantially greater. 

In FIG. 2, the imcertainty space may be narrowed substantially to a 
smaller imcertainty region with assistance data. See U.S. Patent No. 6,121,923 
entitled "Fixed Site And Satellite Data-Aided GPS Signal Acquisition Method And 
System", which is assigned commonly with the present application. In one mode 
of GPS operation, for example, the receiver uses traditional aiding information, 
including almanac, ephemeris, approximate position data, etc. In wireless 
communications applications, the receiver may use aiding data prescribed in a 
wireless assisted over the air standards specification^ for example SVTD, Doppler, 
code phase estimates, etc. The receiver also operates autonomously, without 
assistance data. In FIG. 2, the code phase dimension of the smaller uncertainty 
region includes N search space bins separated from an adjacent code phase bin by 
a BIN_LENGTH quantity, and the Doppler frequency dimension includes M 
search space bins separated from an adjacent bin by a deltaDop quantity. 

The receiver is programmable to search code phase and Doppler 
space for one or more satellites in parallel, and more particularly the receiver is 
configurable dynamically and optimally to search any number of satellites over 
any number of code phase and Doppler bins. 

The receiver employs pseudo or virtual paraUelization through a 
batch parallel/serial architecture running faster than real time to search a specified 
imcertainty space or narrower region thereof including one or more phase delays 
and one or more Doppler frequencies. The exemplary searcher has the capability 
to search 4096 code phase-frequency-satellite search bins in real-time, using a 128 
state flash parallel correlator that is time-multiplexed to cover the total uncertainty 
space or a lesser region thereof. In one embodiment, for example, one or more of 
the plxurality of spread spectrum signals is correlated virtually in parallel over a 
corresponding predetermined Doppler search range having at least one Doppler 
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search bin for a predetermined number of phase delays by sequentially correlating 
at a rate greater than a rate at which the spread spectrum signal is received. In 
another embodiment, a plurality of spread spectrum signals are correlated 
virtually in parallel over a predetermined nimiber of phase delays by sequentially 
correlating ihe plurality of spread spectrum signals witii corresponding replica 
signals at a rate greater than a rate at which the spread spectrum signals are 
received. 

In GPS applications, the 1-millisecond PN code length consisting of 
2046 half-chip delays, are segmented into 16 segments having unequal segment 
lengths by the receiver. The first 15 segments are 128 samples long, and the last 
segment is 126 samples long. Other unequal segmentations may be used also, 
although preferably each segment has a sufficient number of samples to permit 
the desired number of search bins to be tested during the period of time it takes 
to collect the next segment 

FIG. 3 illustrates several exemplary search modes of the spread 
spectrtun signal receiver architecture. In a first mode, 2 satellites are searched 
simultaneously over the entire code phase imcertainty space (2046 half chip test 
bills) each at one Doppler bin or frequency. In a second mode, 1 satellite is 
searched simultaneously over the entire code phase imcertainty space at 2 Doppler 
frequencies. In a third mode, 2 satellites are searched simultaneously over a 
portion of the code phase xmcertainty space (512 half chip test bins) each at 4 
Doppler frequencies. In a fourth mode, 2 satellites are searched simultaneously 
over a portion of the code phase uncertainty space (256 half chip test bins) each at 
8 Doppler frequencies. In a fifth mode, 1 satellite is searched simultaneously over 
128 half-chips of code phase uncertainty space (2046 half chip test bins) at 32 
Doppler frequencies. Other exemplary modes are also illustrated. The last mode 
X of FIG. 3 shows more clearly the degree of search mode flexibility. A total of 11 
satellites are searched simultaneously and in parallel, each over different numbers 
of Doppler bins and each over a different number of half-chip delays, which are 
non-powers of two. The search modes of HG. 3 are exemplary only and are not 
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intended to limit the invention. The exemplary receiver can search and locate up 
to twelve satellite signals in parallel, each satellite searched over miiltiple Doppler 
bins and over an integer number of half-chips of code space uncertainty. The 
search over code phase and Doppler space for each satellite is controlled 
independently. The only restriction is that the number of total search bins cannot 
exceed the capacity of the searcher (4096 for the exemplary embodiment). Also, 
the receiver may search less than 4096 bins, in which case the correlator clock rate 
may be reduced. 

In one mode of GPS operation, a Doppler search range is assigned to 
each of a plurality of the spread spectrum signals received, and one or more 
Doppler search signals are generated for each of the assigned Doppler search 
ranges. Where the Doppler search range includes more than one Doppler signal, 
there is a Doppler step size increment between each of the Doppler search signals 
witihin a particular Doppler search range. In one embodiment, the Doppler search 
signals generated for at least two of the plurality of Doppler search ranges is 
different, and the number of Doppler search signals generated is limited to some 
predetermined number. 

The receiver may also operate in a tracking mode (once Doppler and 
code phase have been found for each visible satellite), in which code and carrier 
tracking loops are enabled in order to continuously update the pseudorange and 
pseudorange rate measurements for continuous position updates. In the tracking 
mode, the clock rate for most of the correlator array can be tuned down to a lesser 
frequency, about 5 MHz in the exemplary embodiment, since only a minimum 
number of code-Doppler bins are searched to maintain track. 

The search space may also be compressed into l-<:hip samples, 
allowing for a doubling of the code phase search space. Each satellite that is 
searched has a software selectable option that allows for 1 chip or chip spacing. 
In the limit, for tiie exemplary embodiment, this option allows for full code-phase 
searching up to FOUR satellites simultaneously. 
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In FIG. 4, the function of the ISIP block of FIG. 1, includes an RF 
processing block 300, A/D conversion 310, and translation and scaling of sampled 
signal data from A/D converter into two-bit I & Q data samples at an effective 
rate, for example 1.023, 2.046, 4.092 or 8.184 MHz, depending on a sample rate 
parameter programmed into the controlling processor. In some embodiments a 
decimator & resampler 312 may be employed to translate samples to the desired 
sample rate. 

The two-bit I & Q data samples are scaled into a sign-magnitude 
format (not two's complement) and each encodes four possible states of signal 
amplitude into the four possible states of the two bits. For example, state "00" 
represents a signal magnitude of +1, state "01" represents a signal magnitude of 
+3, state 10 represents a signal magnitude of -1, and state 11 represents a signal 
magnitude of -3. Any other ordering of state to signal magnitude is possible, 
which would deviate from a strict sign-magnitude format, as long as the flash 
correlator accommodates the format and uses the mapped signal magnitudes to 
form the correlation products. 

In FIG. 4, sampled I & Q signal components are segmented into 
signal segments by a divider block 314. The signal segments are then stored into 
corresponding I & Q Rl registers 316 (only one of which is identified with a 
reference numeral). In the exemplary embodiment, the divider block 
pivl28/126) 314 imequally divides a 1ms long segment of a 2046 half chip PN 
code bits into fifteen signal segments of 128 samples long and into one signal 
segment 126 samples long, and then repeats. Preferably, a last signal sample of 
the one signal segment is separated from a first signal sample of an adjacent 
signal segment by a single sample. Each completed segment is clocked into the 
Rl register 316 at a rate of approximately 16 KHz, or sbcteen times for each PN 
code repeat cycle. The divider block Divl28/126 also outputs a segment length 
signal, for example a 1 or 0, to subsequent processing elements to indicate 
whetiier to process 128 samples or 126 samples of the Rl register. 
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FIG. 5 is a more detailed illustration of the I & Q Rl registers 316 of 
FIG. 4, which store the two-bit I & Q signal segments. Particularly, the register 
block comprises I & Q serial to parallel registers 510 and 520, and a GPSQneKHz 
clock generator 530. In the exemplary embodiment, the tmequal segments are 
clocked into the I & Q registers 510 and 520 sequentiaDy at a rate of approximately 
16 KHz. A signal (NewRl) is generated to indicate completion of a NewRl data, a 
signal (Seg_Ntim) is generated to indicate the segment number of the NewRl, 
GPSOneKHz clock, and segmented samples of I & Q. Other embodiments may 
include parallel arrays of I & Q registers. 

In the exemplary architecture, the clock signals generating I & Q 
signal segments and the clock loading the Rl registers are the only clocks in the 
receiver that require receiver quality stability, i.e., low phase noise clocks having 
an Allen variance not greater than 3 * 10-^. The dock circuit that clocks the 
remaining receiver portions can be any clock operating above some minimum 
clock rate, which in the exemplary 4096-search bin architecture is not less than 
about 65 MHz (64 * 1.023). The architecture is thus compatible with baseband 
architectures in which the DSP or other processor runs at some rate between 70 
and 105 MHz. 

In FIG. 4, the output of a Doppler NCO 318 is sampled and 
parallelized to two R2 registers 322 (only one of which is illtistrated) in a marmer 
similar to the input signal segmentation sequence. The number of samples in the 
R2 register is either 128 samples or 126 samples, depending on the segment length 
output of the Divl28/126 divider block 314. The R2 registers are coupled to a 
multiplier block 326 by a multiplexer 328. This multiplexer arrangement allows 
on parallelized R2 Doppler wipeoff waveform to remain constant for a number of 
clocks and be fed into the flash correlator, while another R2 Doppler wipeoff 
waveform is prepared by the NCO. In the exemplary embodiment, llie Doppler 
Generator has a limited number of clock cycles available to generate the 128 or 
126 Doppler samples loaded into the R2 register. Doppler signal generator 
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designs suitable for tise with the exemplary architecture include using four 
traditional NCO's in parallel, or a SIN/ COS state-transition computer. 

The exemplary embodiment has a 32-clock cycle limit in order to 
load the R2 register for the next correlation. In FIG. 6a, four NCO's 602, 604, 606 
and 608 create the Doppler pattem for the register 610. If not for the 32 clock cyde 
limit, then a single 24 bit NCO that generates a COS and SIN output (one bit each) 
would be sufficient to create the 128 1 & Q samples. The 1-bit SIN output of NCO 
is the MSB of the integrator (bit 23 of a 24 bit NCO), while the COS output is the 
EXOR of the MSB with the next lower bit of the integrator (bits 23 and 22). The 
two output bits of the NCOs are delivered to corresponding 128 state serial to 
parallel shift registers 603, 305, 607 and 609 when the 128 samples are created, and 
then the R2 register 610 captures all 128 states simultaneously. 

In the exemplary embodiment having a 32-clock cycle limit, 
parallelization reduces the nxmiber of clocks at each stage. The four NCO's in FIG. 
6a each have the ability to load the starting phase and frequency at clock zero. The 
1«* NCO creates the Doppler pattem for samples 1 through 31, while the 2^^^ NCO 
generates the pattem for sample numbers 32 through 63, the 3^^^ NCO creates data 
for samples 64 through 95, and the 4*^^ NCO for samples 96 through 128. Since the 
NCOs are docked simultaneously, it takes only 32 docks to create all 128 samples. 
The starting phase of the 2^^^ through 4* NCO are computed as follows: 2"^ NCO 
Starting Phase = l^t NCO Starting Phase + 32 * Fw; 3rd NCO Starting Phase = l^t 
NCO Starting Phase + 64 * Fw; and 4th NCO Starting Phase = l^t NCO Starting 
Phase + 96 * Fw. The final phase stored in 4^^ NCO after the 32"^ system clock is 
written back into a Phase RAM 612 for use during the next Rl register sample. For 
the signal segment having only 126 samples, the 4*^ NCO is short cycled only 30 
dock cydes. 

The phase stored in RAM 612 is stored in 32 bit wide words. The 
lower 24 bits represent fractional phase of one carrier cycle, while the upper 8 bits 
represent integer carrier cycles accumulated. The word size of the RAM 612 is 64 
words. The frequency RAM 611 also contains 64 entries but stores only the 24 bits 
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representing a frequency word of a desired satellite/ Doppler bin. The 64 words 
contained in the frequency RAM 611 and phase RAM 612 can be allocated to one 
or more satellites, each word corresponding to one Doppler bin and one satellite. 
One NCO structure is time-shared to create Doppler signals stored in the R2 
register to represent up to 64 possible Doppler signals. For example, 64 Doppler 
bins can be allocated to one satellite, consxmiing all 64 data words. Alternately, 32 
Doppler bins can be allocated for two satellites, again consuming all 64 data 
words. Any combination of Doppler bins per satellite can be allocated, as shown 
in FIG. 3, as long as the total number does not exceed the 64 memory locations of 
RAM 612 and 611. 

A phase difference circuit 613 shown on FIG. 6a is used to compute a 
PN phase delay correction to the PN code generator circuit. The phase difference 
need not be a 32 bit difference, rather, a 10 bit difference of the upper 10 bits of the 
32 bit long phase word is sxifficient, and reports a phase difference in integer and 
fractional cycles down to cycle resolution. The phase difference circuit 
computes a difference in phase that occurs for each Doppler signal generated (both 
integer and fractional cycles to V4 cycle resolution) and passes the difference to the 
code phase computer block (60 on FIG. 1), and ultimately to the PN Generator 
block (40 on FIG. 1) so as to keep the code phase stationary in one code phase bin 
during an integration time. In flus way, the R3 PN code signals are time-shifted to 
compensate for Doppler induced code phase shifts of the received signals and to 
keep the correlation sum stationary in one accumulation bin. It is well known that 
GPS signals are coherent between Doppler and code phase, so that the 
accxmxulated shift in range due to Doppler cycles can be used to compensate for 
code phase delays. For example, the GPS signal carrier frequency of 1575.42 MHz 
and the PN code chipping frequency of 1.023 MHz are coherent, that is, the same 
oscillator generating the carrier frequency is used to generate the PN code clock 
internally to each satellite. As such, each PN code clock represents 1575.42 MHz / 
1.023 MHz, or 1540 carrier cycles per PN code clock. By accumulating the Doppler 
carrier cycles (by accumulating the phase difference of the output of 613, it is 
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possible to make continuous code phase adjustments so as to keep the replica PN 
code signal generated by 40 of FIG. 1 coherent with the received signal. Since the 
receiver of this design uses primarily Vi chip spaced samples (stored in Rl, R2, and 
R3 registers), flie number of integer Doppler signal cycles needed to be 
accumulated before making a code phase adjustment of y2 chip is % * 1540, or 770 
carrier cycles. Consequently, by accmnulating the phase difference quantities for 
each Doppler bin/ satellite (generated on FIG. 6a and consumed in the code phase 
computer block 60 of FIG. 1), the code phase loaded into the PN generator block 40 
of FIG. 1 is adjusted by V2 chip for every 770.0 accximulated Doppler cycles. 

Depending on the RF implementation, the code phase computer may 
either add or subtract % chip from each accumulation of 770 cycles. For example, 
if the RF circuits use high-side injection or low side injection (that is, titie local 
oscillator is higher or lower than the desired signal), the received signal may cause 
a decreasing code phsise accumulation or an increasing code phase accxmitilation. 
For this design, the processor can set a parameter called APAD, or auto-phase 
advance direction register, which controls if the accumulated Doppler cycles add 
or subtract from code phase. 

FIG. 6a also illustrates a compensation circuit that accounts for 
reference oscillator offset frequency and ttie rate of frequency change. These 
parameters, OSCL^RATE and OSCL^FREQ, are estimated by the controlling 
microprocessor, and are written by the processor to offset all Doppler 
measurements for all satellites, thus accounting for reference oscillator offset 
frequency. The parameters are clocked into a 32-bit accumulator 614 by an Fm 
clock signal or a measurement epoch clock. The accumulator runs at a clock rate 
s)mchronous with every 16* Rl register load cycle, or once every 1 millisecond. 
The output of the accumulator changes by one addition of the OSCL^RATE input 
parameter for every 16 new Rl register load cycle. The accumulated sxmi of 
OSCL_FREQ and OSCL^RATE * N can be observed by the processor on the 
OSaL.KT register, as it is captured on the next measurement epoch output clock 
which is S3mchronous with the interrupt. Bit alignment is as follows: The 24 bits 
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of OSCL_FREQ are aligned up-shifted by 8 bits into the accumidator. The 10-bit 
OSCL_RATE parameter is bit-aligned with the accumulator. The output 24 bits 
that drive the OSCL^EST register and that feed into the parallel NCO are up- 
shifted by 8 bits, that is, these outputs observe the 31** through 8*** bit of the 
accumulator. The range of the OSCL_RATE parameter is between a maximxmi 
OSCL.RATE = 1000 * (4 * 0.1220 Hz) = 488 Hz per second and a minimum 
OSCL^RATE = 1000 * (0.1220 / 256 Hz) = 0.476 Hz per second. 

The code phase computer block 60 of FIG. 1 performs the following 

functions: 

a) Keeps track of the absolute code phase for each segment in 
memory. The processor will read the code phase directly from the code phase 
computer memory. 

b) Accumulates Doppler induced changes in code phase by counting 
Doppler cycles (auto phase advance in code phase). 

c) Computes the integer number of half chips (pre-positioning data) 
that the PN code generator will use din-ing the next application of the signal, 
including auto-phase advance components. 

d) Compensates the integer code phase half-chip offsets based on the 
segment number in Rl. 

e) Accotmts for processor commanded manual adjustments to the 
code phase in tmits of Doppler carrier cycles via a processor write able parameter 
called the MANL„CP_ADJ input. 

The code phase computer must accumulate the code phase for 
periods when the PN generators need a pre-positioning target code phase for 
each segment. As such, it must compute a starting code phase for each 
BIN^LNGTH segment in the coherent and non-coherent RAM 16 times per 
millisecond, corresponding to the 16 different segments stored in the Rl register 
per millisecond. As is the case in the Doppler generator block 50 of FIG. 1, the 
code phase computer block 60 of FIG. 1 needs working RAM in order to keep up 
to 64 unique code phase registers. 
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For convenience, the following format is proposed for the code phase 
registers, the format is maintained by the code phase computer. Each register is 
subdivided into three segments as shown in FIG. 6b. The code phase stored in 
each register refers to the code phase delay of the l^t y% chip element in each 
memory segment The code phase delay for other elements within each segment is 
simply based on the number of integer % chips away from the 1^* element of the 
segment. 

Fractional chips are accumulated in units of V4 carrier cycles in order 
to be compatible with the Doppler generator change-in-phase output. The 
exemplary Doppler generator computes the total number of Vi carrier cycles 
within each Rl register processed for each correlator segment contained in the 
coherent RAM. As such, the fractional chips portion of the code phase 
acamiulator takes on values of zero through 769,75 cycles, after which it causes a 
carry into the code lengths portion of the accumulator. At GPS L-band signal, for 
which the PN code and carrier are coherent, there are 1540 carrier cycles for each 
PN code chip, or 770 for a chip length. Consequently, in order to maintain 
coherence and accommodate Doppler iaduced code phase delay changes, it is this 
process of accumulating 770 carrier cycles and then carrying forward into the next 
field (half chip delays) that accomplishes the process called auto-phase advance. 

Likewise, the Code Lengths portion of the accumulator takes on 
values between zero and 1022.5 chips in Vi chip delay increments, or integer values 
between 0 and 2045. When the code lengths portion rolls over, the integer code 
lengths portion of the accumulator is incremented one time. The total number of 
bits containing the integer code lengths portion of the accimiulator is set based on 
the maximum time expected to track a satellite (in a tracking mode, 10 hoxirs), and 
the maximum rate of change for such a signal. 

The algorithm to be applied by the code phase computer is best 
described in a C-code equivalent notation The architecture of Ae modxile can be 
represented by an ALU, bit-shifter, and RAM block. The actual architecture is not 
important, as long as it performs the described algorithm. The primary 
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requirement is that the code phase computer complete its update of the code phase 
parameters for all satellites and all Doppler signals needed (unique values of R2) 
and deliver the predicted pre-position code phase to the PN generator within the 
minimum ntunber of system clock cycles, 32 cycles. The algorithm is described in 
an initialization mode and a running mode. 

In the Initialization mode (before an integration dwell period 
begins), the following occurs: 

For each satellite segment in the code phase computer memory, 
Integer_Code_Lengths[i] = 0; 

Code_Lengths[i] = CP_OFFSET[i] (copy of the code phase offset 
register stored in the configuration block); 

FractionaLCode_Phase[i] = N * Delta_Cp; where N = the bin 
nximber for any block of segments having more than one defined NUM3INS, and 
Delta_Cp is a change in code phase for subsequent bins of the same satellite, 
generally in integer carrier cycles and thus can represent a step size offset of the 
PN code of much less than V2 chip. For example, Delta_Cp can be 77 carrier cycles, 
which allows for PN code offset of 77 / 1540 chips, or 0.05 chips. Delta_Cp may 
tihus be used to offset subsequent bins by an amoxmt proportional to 
Delta_Cp/1540 chips. The parameter Delta.Cp is controllable per satellite by the 
controlling processor and can be set to any integer carrier cycle value, thus the 
stepsize in offsets for multiple code phase bins on a particular satellite can be in 
steps as small as 1/1540 chips, or 0.000649 chips. In this way, the delay difference 
from one bin to another can be adjusted to much finer than V2 chip delay. 

In a Run Mode - 

This process generally runs some time after the Doppler generator 
has generated an R2 for the selected satellite/Doppler bin. The process to update 
the code phase after each Rl segment is processed for each satellite for each 
coherent RAM segment is: 



# 
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FractionaLChips = Quarter_Cycle_Count (from Doppler Gen) + MANLuCP^ADJ (also in quarter cycles); 
MANL^CP^ADJ ° 0; (after the last bin is updated for this satellite); 
If(APAD 1) /* ** Code Phase Increases with each 770 Doppler Cycles ***/ 
I 

If (FractionaLChips >= 770.0 cycles) 
{ 

FractionaLChips -= 770.0; 
Code^Lengths += 0.5; 
If(Code_Lengths >» 1023.0) 
{ 

Code^Lengths -= 1023.0; 
Integer_Cod^Lengths += 1; 

} 

} 

eke If (FractionaLChips < 0.0 q'des) 
{ 

FractionaI_Chips += 770.0; 
Code_Lengfts — 0.5; 
If (CodeL.Lengths < 0.0) 
{ 

Cod€s_Lengflis += 1023.0; 
Integer_CodcL.Lengtihs -= 1; 

I 

1 

) 

else /* Code Phase Decreases with each 770 Doppler Cycles */ 
{ 

If (FractionaLChips >= 770.0 cycles) 
{ 

FractionaLChips — 770.0; 
Code_Lengtiis -= 0.5; 
If(Code_Lengths < 0.0) 
{ 

Code_Lengtihs += 1023.0; 
Integer_Code_Lengths -= 1; 

) 

} 

else If(Fractional_Chips < 0.0 cycles) 
{ 

Fractional^Chips +=» 770.0; 
Code_Lengtiis += 0.5; 
If(Code.Lengths > 1023.0) 
I 

CodeL-Lengths 1023.0; 
Integer_Code_Lengths 1; 

) 

) 

J 

Creation of titie Next FN Code Preposition Data 
The next time this particular satellite segment needs to be pre-positioned by 
the PN code generator, the FN Code Generator is given the integer count of half- 
chips stored in the Code^Length Register offset by the segment number of the Rl 
data. More specifically, Preposition_PN„(3ount = 128 * Se^Num + 
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Code_Length„Register; Where Seg^Num refers to the Rl data segment number (0 
through 15). Note that the multiply by 128 can be implemented by a simple bit 
shift of 7 places. Finally, the Rreposition_PN_Count needs to be compensated for 
any overflow or underflow, with a function such as: 

If (Preposition_PN_Count >= 1023.0) Preposition_PN_Cotint -= 1023.0; 
If (Preposition_PN_Count <= 0.0) Preposition_PN_Couiit +» 1023.0. 

In FIG. 4, the output of a PN replica code generator 320 is sampled 
and parallelized to two R3 registers 324 (only one of which is illustrated) in a 
nfianner similar to the input signal segmentation sequence. The number of 
samples in the R3 registers is either 128 samples or 126 samples depending on the 
segment length output of the Divl28/126 divider block 314. The R3 registers are 
coupled to the multiplier block 328 by a multiplexer 330. 

In FIG. 7, an exemplary ROM based PN code generator generates all 
128 states of the R3 register within the required 32 system clocks. The contents of 
shift registers 702 and 704 are deterministic, wherein each of 1023 states defines 
one particular bit of a selected code. The states of the shift registers are stored in a 
corresponding lookup table ROM. The Gl lookup table ROM 706 requires 1023 
bits (128 words by 8 bits), and the G2 lookup ROM 708 requires 2176 bits (128 
words by 17 bits). 

On one system clock cycle, the Gl and G2 ROM pair, in conjtmction 
with the parallel EXOR and MUX Block 710, create 8 parallel bits representing 8 
consecutive states of the output of the PN generator for a selected signal. The first 
bit coincides with the bit position (i.e., PN code bit state number) represented by 
the input address to the ROMs multiplied by eight. The R3 register captures 128 
bits. Each bit from the PN generator is duplicated twice (R3 contains a 2.046 MHz 
rate sampling of the 1.023 MHz PN generator), so that the G1/G2 and Parallel 
EXOR and MUX block must create 64 bits of the PN sequence for preloading. The 
R3 register is organized as 8 copies of a 16-bit long shift register. The last eight 
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copies contain 128 bits of the R3 register, while the first is a shift-register load 
register loaded by the G1/G2 ROM and Parallel EXOR and MUX block. 

The circuit parallel loads 16-bit shift registers. In a preload mode, 
the circixit successively loads 9 registers in nine clock cycles. After the shift 
registers are loaded, the state of the R3 register represents that portion of the 
selected PN code, flie l^t bit of the shift register being that bit coinciding to the 
closest state number (within 16 clock cycles of the target state). The shift registers 
are then clocked the remainder number of clock cycles to preposition the R3 
register into the desired starting state, the state computed by the code phase 
computer and delivered to the PN generator by the Preposition_PN_Count 
parameter. The top 7 bits of the propositioning count go directly to the ROMs. An 
address count of -1 through +8 is added to the address to generate the address 
looking up the contents of all 9 of the 16 bit shift registers that make up the R3 
register. The lower 4 bits (i.e,, the remainder) represent the integer number of 
clocks necessary to put the R3 register into the desired initialization state. 

As noted, there are two copies of the R3 registers, both driven by one 
copy of the G1/G2 ROM and Parallel EXOR and MUX block. In one embodiment, 
the two R3 registers alternate between a preload state and a run state. In a RUN 
mode, the R3 register is shifted once per clock. The next 16 bits of the code 
required are loaded into a LOAD register on the proper clock so that a continuous 
stream of PN bits are loaded into the R3 register. FIG. 8 is a more detailed block 
diagram of the Parallel EXOR and MUX block. To generate eight successive bits in 
one parallel clock cycle for any signal, the circuit duplicates the circuitry for 
exclusive OR-ing of any two bits from the G2 shift register along a 10-bit interval. 
Thus, two 10 bit data selectors and one exclusive OR gate are xised for every bit of 
the eight. In addition, the G2 shift register ROM generates a total of 17 total state 
bits so that the 10 to 1 selectors can view the proper ten bit range for each of the 8 
bits. 

In FIG. 4, the correlation is performed by a 128 tap high speed flash 
parallel correlator 332 that correlates sample by sample the contents of the 
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complex (two-bit I and two-bit Q) Rl register with the complex (one-bit I and one- 
bit Q) R2 register with the real one-bit R3 register, summing all taps to form a 
SUMI and a SUMQ output. 

The madiematical operation performed by the correlator can be 
described in FIG. 9a and the accompanying pseudocode as follows: Assume that 
the Rl register consists of 128 samples of I & called Rli[k] and Rlq[k], where k 
is a sample number within the Rl register; 0 <= k <= max. Likewise, the R2 
register consists of 128 samples of I & Q, called R2i[k] and R2q[k]; 0 <= k <= max. 
Finally, the R3 register consists of 128 samples of the real PN code sequence, 
called R3[k]; 0 <= k <= max. IN the exemplary embodiment, the value of max 
changes between 127 and 125 depending on whether the Rl register contains a 
128 sample Rl value or a 126 sample Rl value. The Rli and Rlq terms can take 
on the value of +1, +3, -1, and -3. The R2i and R2q can take on the values of +1 
and -1. The R3 values can carry the values +1 or -1. 

The flash correlator forms the product of sums as follows: 

For(k-0; k<max; k++) Sum += R3[k] * (Rl[k] * R2[k]); 

Expanding this expression to show the complex operations produce 

For (k=0; k< max; k++) Sum += R3M * [(Rli[k] + jRlq[k]) * (R2i[k] + jR2q[k])]; 

Expanding fiuiher and collecting in-phase and quadrature components produces: 

For (k=0; k< max; k++) Sumi +« R3[k] * [(Rli[k] * R2i[kl) - (Rlq[k] * R2q[k])]; [1] 
For (k=0; k< max; k++) Sumq R3[k] * [(Rli[k] * R2q[k]) + (Rlq[k] * R2i[kD]; [2] 

Finally, the complex product of Rl times R2 can be described as the 
product register, Pi[k] and Pqlk], where: 



Pi[k] - (Rli[k] * R2i[k]) - (RlqM * R2q[k]); 
Pq[k] = (Rli[k] * R2q|k]) + (Rlq[k] * R2i[k]); 



[3] 
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The product registers Pi and Pq are constant for each Rl and R2 
value and contain max elements. The final sum (Sumi and Sumq) can be written 
in terms of R3 times the product registers (Pi and Pq) as follows: 

For(k=0; k< max; k++) Sumi += R3[k] * Pi[k]; [5] 
For(k=0; k< max; k++) Sumq += RSfk] * Pq[k]; [6] 

To maximize the number of correlations performed in a time 
interval, these sums, Sumi and Sumq, in equations [5] and [6] are computed in 
one dock cycle, which may be pipelined, by one large block of logic that 
accomplishes the function. 

FIG, 9a illustrates the preferred embodiment of the flash correlator 
block. Equations [1] through [6] describe the mathematical operations performed 
on each element of the Product register. Pi and Pq, and the subsequent outputs 
Sumi and Sumq. The specific design of the flash correlator is highly dependent on 
the method of encoding the data associated with the Rl and R2 data inputs. The 
encoding of the I & Q samples of the Rl input samples is assumed to be: 

Two Bit Encoding Value 

00 +1 

01 +3 

10 -1 

11 -3 

And the One-Bit I & Q samples of R2 are encoded as: 

One Bit Encoding Value 

0 +1 

1 -1 

The One-Bit samples of R3 are encoded as: 

One Bit Encoding Value 

0 +1 

1 -1 



FIG. 9b also illustrates all 64 possible combinations of the bit 
patterns associated with the two bit I & Q Rl samples multiplied by the one bit I 
& Q samples of the R2 register, i.e., the product register 326. Thus it is not 
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necessary to encode the samples into twos-complement notation in order to form 
the iimer products of Rl * R2. All 128 outputs of the Pi[k] and Pq[k] logic block 
multipliers are summed in separate I & Q data paths to create the Sumi and Sumq 
outputs. Note that the output of the summer is truncated by one bit (see the 
output path in FIG. 9a) because the output of the flash correlator summer is 
always an even number. A logic block simply has to duplicate the truth table of 
FIG. 9b, which includes the divide-by-two embedded in the table. Coltimn 
"Pi_scaled'' is different from column "one element of Pi'' by a factor of two, and 
column 'Tq_scaled" is different from column ''one element of Pq" by the same 
factor of two. Encoding the four input states (+1, +3, -1, -3) of the Rl data in this 
manner and accomplishing the multiplication with logic elements avoids twos 
complement math, which requires one more data bit (3 each for I & Q Rl register, 
for example) in the data path and uses additional hardware elements. 

FIG. 9c is an exemplary correlator block diagram including a first 
and second multiplier blocks 900 and 902 corresponding to the multiplier blocks 
326 and 332 in FIG. 4, respectively. FIG. 9c also illustrates summation of the 
outputs of the 128 parallel correlators, depending on the segment number at a 
sunmaation block 904, which corresponds to a sxmimation block 333 in FIG. 4. In 
the exemplary embodiment, the Rl register output is 128 samples long for all 
signal segments except for the last segment, which is 126 samples long. The 
outputs of the correlators 127 and 128 are ignored during summation of the 16th 
signal segment under control of the Segment Select signal. 

The correlation process is illustrated schematically in FIG. 10a. The 
input samples from the Rl register and the R2 register are multiplied to produce a 
product register P. The product register P is then held constant while a pre- 
determined range of code phase delays of the replica PN generator (stored in the 
R3 register) is applied to a final multiplier. All 128 states of the multiplier outputs 
are then summed together in the sununer block 333. 

The R3 register contains 128 states of the replica PN code for a 
particular code phase delay. For each code phase delay, a coherent RAM memory 
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334 is used as a large number of separate accumulators, one for each of tiie 
possible code phase delays tested. In a half-chip mode, the contents of the R3 
register is advanced one half-chip delay per clock cycle, and the coherent RAM 
334 is advanced one address so that each memory location in the coherent RAM 
represents successive one half-chip delay trials of code phase delay. For example, 
tiie results of the correlation of the R3 register for delay zero is written into the 
delay-zero accumulator address, the results of the correlation of the R3 register 
for delay one (half-chip different) is written into the delay-one accumulator 
address and so-forth. Up to 2046 half-chips are tested per signal or satellite 
representing all possible code phase delays, or a pre-determined number of code 
phase delays are tested, filling only the corresponding number of coherent RAM 
memory locations. In a one-chip mode (described later and shown in FIG. 10b), 
successive coherent RAM addresses represent one chip spaced delays. 

Every one of the 2046 possible states of the R3 register form a 
unique SUMI and SUMQ output that is summed into the coherent accumulation 
RAM 334 in successive memory locations. The receiver is programmable so that 
coherent accumulation is over N milliseconds, wherein each satellite is 
programmable over a different value of N by storing an N value for each satellite 
searched and applying the N value to the coherent accumulator when the 
particular satellite is being searched. 

The parallelization of the receiver may be extended in architectures 
where more than 4096 search bins by merely including rtiultiple copies of the 
receiver or portions thereof or by increasing the size of memories 70 and 80 and 
increasing the system clock rate accordingly. For example, doubling the word- 
count of memories 70 and 80 would result in 8192 memory locations. This would 
give the receiver the ability to compute 8192 unique satellite/ Doppler/ code 
phase trial bins, instead of the original 4096. In order to process all 8192 
satellite/Doppler/code phase trial bins within one update cycle of the Rl register 
(1/16 of one millisecond), tihe system clock rate would have to be greater than or 
equal to 128 * 1.023 MHz. Another way to do this is to change the segment length 
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to more or less than 128 samples. For example, if the segment length is made to 
be 256 samples long, then the system would segment the 1 millisecond long FN 
code by performing partial correlations over seven 256 sample-long segments and 
one 254 sample long segment in order to cover the 2046 half-chips of code phase 
test space. It also gives the receiver more time per Rl update cycle to perform the 
sequential partial correlations, allowing for a system clock rate that is half of that 
originally to accomplish the same number of correlations (4096), or by allowing 
for processing twice the number of correlations (8192) if the clock rate is held to 
the same value. 

Once a spread spectrum signal segment is stored into the Rl 
register, subsequent processing is independent of input clock rate. It is therefore 
possible to process the entire code phase space for 1, 2, 4 or 8 signals or satellites 
as long as the correlator, FN generator, Doppler NCO and accumulation RAM 
run fast enough. For example, a full code phase space search on 8 satellites may 
be performed if the FN code generator, R3 register, and Coherent RAM are 
clocked at (8 * 2048 correlations) / (1/16 * 0.001 sec) MHz, the accximulation RAM 
is expanded to hold 8 * 2048 words, and by timing accordingly the multiplexers 
and coherent accumulation RAM. 

The exemplary architecture exhibits clock-rate scalability, which can 
exploit increased clock speeds associated with semiconductor process shrinks. As 
noted, the correlator arrays can be programmed to run at a variable dock rate 
depending on the total nimiber of search (code phase and Doppler) bins 
necessary for a particular problem. 

While computing the correlation tests for up to 2046 possible delays 
for one satellite, the contents of the Rl register remains stationary during the 
collection time of the next segment (approximately 1/16 KHz of time in the 
exemplary embodiment). During this period of time, a predetermined number of 
code phase and/ or Doppler bins are searched for the segment collected 
previously. When aU code phases have been searched for one Doppler bin, a new 
value can be loaded into R2 to search a new Doppler bin. 
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In FIG. 11, the coherent integration block accumulates up to 4096 in- 
phase and quadrature correlation sums in a memory array 102 consisting of 10 or 
more bit-wide integrators for SUMI and SUMQ signals. The array is segmented 
into blocks corresponding to a particular satellite PN code, code phase range, and 
Doppler frequency. Each of the 4096 I & Q memory locations may be considered 
as one accumulator, which holds the correlation results for one satellite, one code 
phase delay (in half-chips), at one Doppler frequency. The array 102 is preferably 
configured with separate read/write input paths to pipeline processing in a single 
dock cycle per memory address update. Other configurations are also possible as 
long as the processing is limited to one clock cycle per memory element update, 
including, for example, a dual single port memory scheme where A and B memory 
is used to separate the read and write processes, doubling the clock frequency on a 
single port memory to allow one cycle read, one cycle write, and a true dual-port 
memory design. 

The flash correlator sum corresponding to one particular code phase 
delay, one satellite, at one Doppler frequency is input to the coherent memory via 
SUMI and SUMQ inputs. Each clock cycle delivers correlation sums for one 
correlation on the two inputs SUMI and SUMQ. The flash correlator outputs are 
truncated a fixed number of bits as specified by a pre-shift parameter at 104 to 
reduce dynamic range, tmder control of the controlling processor or DSP. The pre- 
shift value is configurable for each satellite searched by storing a pre-shift value 
for each satellite searched and by multiplexing the pre-shift value to the shifter 
when the particular satellite is being searched. The address sequencing starts at 
address 0 and increments one address for every correlator sample on SUMI and 
SUMQ. The 4096 words of the coherent memory array are updated before the next 
Rl sample is available. 

The coherent accumulators sum over an integer number of 
milliseconds corresponding to a selectable PDI register setting for each satellite. 
The integration is integer-fixed point, with fixed point scaling of 2-^ on the input 
quantities SUMI and SUMQ. The coherent integration period can vary from 1 to 
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20 ms (one GPS navigation message bit time) in increments of 1 ms. For a 1 ms 
coherent integration, for example, sixteen successive Rl registers (corresponding 
to signal segments 1-16) are processed and summed into the coherent integration 
memory before the final sxmi is communicated to the non-coherent integration 
block 336 in FIG. 4. At tfie end of 1 ms, the coherent RAM segment is cleared. The 
coherent integration period can also extend beyond one bit-time (20 ms) provided 
the system has knowledge of a particular sequence of navigation bits. Precise local 
time knowledge to 2 ms is required for 20 ms coherent integration. 

On the 1st update to the memory (i.e., on the l^t iteration of a 
coherent sum), an AND gate 106 driving a port of an adder 108 is cleared, forcing 
zeros into that port of the adder. The 1^* correlation results from the correlator are 
loaded directly into memory. Subsequent memory sums are formed by passing 
previous accumulations to the adder with the AND gate to form 4096 integrators. 
One Rl segment after the last sum of a PDI interval the AND gate 106 output is set 
to zero. The read function operates normally, fetching the desired sum from 
memory. The sum is passed to and captured by the non-coherent integrator block, 
discussed below. Finally, the l^t sum of the next coherent interval is written to the 
memory address, beginning the process anew for the next coherent integration 
period. 

In some modes of operation, the coherent integrator may overflow 
its 10 or more bit-wide dynamic range, for example, if the PDI is set to 20 
milliseconds or higher and/ or if the Pre-Shift scale factor is set low (example, to 2^ 
or 2-1), and signals are stronger than expected. Accumulation overflow may be 
prevented by adjusting the scale factor for each of the plurality of spread spectrum 
signals. In one embodiment, at least some results of correlating each of the spread 
spectnun signals are scaled by different scale factors, for example by lowering the 
PDI and/ or by raising the Pre-Shift parameter. 

In FIG. 11, an overflow detector 110 has an input coupled to the 
input to the memory array 102, and outputs coupled to tihe adder 108 and to 
overflow count update logic 112. When an overflow condition occurs, the 
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integrator output is set to a maximum or minimum value. A counter 114 then 
records the number of overflow conditions, defined by one of 64 possible bins 
(bins here refer to the number of peak detector bins, Doppler registers) and sums 
the total as long as tiie non-coherent integrator is running. For example, if the 
non-coherent integrator is programmed to run for 200 ms and the PDI is set to 10 
ms, then the overflow counter for that particular bin counts the total number of 
coherent integrator overflows in ihat bin during the entire 200 ms non-coherent 
integration ''run" time. In this example, up to 20 overflows may be detected. The 
controlling processor can then read the output as part of the peak detector output 
to verify that none or only a few overflows occurred during the dwell. 

In FIG. 12, a non-coherent integrator accumulates up to 4096 signal 
magnitude sums in a memory array 122 segmented the same as the coherent 
integration block discussed above. Each accimiulation sum holds the correlation 
results for one signal or satellite, one code phase delay (in half-chips) at one 
Doppler frequency. As in the coherent integrator, each memory block uses a 
fixed-point scaling, for which a fixed-point scale factor is associated with the entire 
block. The coherent integrator provides 10 or more -bits-wide coherent I & Q 
inputs to a JPL magnitude detector 124, which computes an approximation to I & 
Q signal magnitudes using the algorithm illtistrated. The inputs are synchronous 
with the end of a particular segment's coherent integration interval. 

The non-coherent integrator also includes an adder cell 126 wiiii a 10 
or more bit wide port from the magnitude detector 124 and an 18 bit input port 
from an UP shifter 128 that shifts contents of the non-coherent accumulation 
memory array 122 (10 or more bits wide) to an equivalent magnitude. The scaled 
output of the non-coherent memory is added to a new magnitude (most recent 
coherent sum), which is shifted by a dov^-shifter 132 to the 10 or more bits of 
dynamic range of the memory array. 

A priority encoder logic block 134 records an output magnitude of 
the dovm-shifter, and inputs a latched PE signal to a scaling logic block 136 to shift 
a scale factor when necessary to prevent overflow conditions on subsequent 
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accumulations. Particularly, when any memory element has an active MSB bit, the 
scaling logic 136 increments the scale by one cottnt so that tiie next time the 
segment is accessed overflow is prevented by applying the proper scale. The 
scaling logic updates the contents of the non-coherent global scale factors RAM 
138 after a PE status flag has been latched, before the next segment is processed. 
The non-coherent global scale factor RAM contains both a ''current" scale and a 
"next" scale quantity, both of which are initialized to zero at the start of a non- 
coherent integration. Each scale quantity represents a number of bit shifts for the 
corresponding UP and Down shifters. The up shifter 128 gets the "current'' scale 
factor and the down shifter 132 gets the "next" shift quantity. Zero means no shift, 
one means 1 bit shift, etc. The "next" scale is updated by adding the contents of 
the latched PE value to the previous "next" scale factor. Note that this can also be 
accomplished by using one "current" scale factor storage plus a sin^e bit which 
indicates whether the next up-shift should be one less than the next down-shift. 
This is in contrast to storing a current and next exponent value and saves memory 
area in the scaling circuit. In either case, the entire correlator block is scaled by one 
exponential value or block value. In one embodiment, all subsequent accumulated 
magnitudes for one or more of the predetermined phase delays are scaled with a 
common scaling factor if it is determined that the subsequent accximulation of the 
greatest accumtdated magnitude will result in an accumulation overflow. 

In FIG. 4, a peak detector and output registers 338 are coupled to the 
non-coherent memory to aid the controlling processor in determining detection 
conditions and to reduce the throughput required to scan the integrator outputs 
for signal detection. The peak detector and registers also provide signal tracking 
(code and carrier) paths. In addition, the detector and registers relieve the 
processor from reading all 4096 words of non-coherent integration RAM and from 
testing for signal detection conditions on each segment of data in memory. The 
peak detector and output registers scan tiie non-coherent integration RAM once 
per RAM update. 
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In FIG. 13, the peak detector comprises a register 142 for storing the 
address of the signal maximum in the segment (P), a register 144 for storing the 
address of the next signal maximum in the segment (MP), a register 146 for storing 
the address of the next, next signal maximum in the segment (NNP), a register 148 
for storing the sum of all of the signal magnitudes in the segment (SUM_MAG), a 
register 150 for storing the maximum signal magiutude (P^DAT), a register 152 for 
storing the next maximum signal magnitude (NP.DAT), and a register 154 for 
storing the next, next maximum signal magnitude (NNP^DAT). A 64 word by 86- 
bit output register RAM 156 stores these parameters. Each RAM word coincides 
with one segment of data (one Doppler, one Satellite, and one code phase delay), 
and the bit positions associated with the memory coincide with the various 
elements. The output register RAM, when read by the controlling mdcroprocessor, 
is mapped into 16 bit words (P, NP, NNP) and 32 bit words (SUM_MAG), and 16 
bit words (P^DAT, NP^DAT, NNP^DAT), so tiiat the least significant bit of each 
parameter shows up on tiie least significant data bus bit. Other values, such as 
coherent overflow count and non-coherent scale factors, can be updated in the 
output register at the same time the peak detector data is transferred to the output 
register RAM. 

The sum of all signal magnitudes in the segment may be used to 
compute an approximate peak signal to average noise. The peak detector and 
output register function may run during the process of updating the non-coherent 
integrator (i.e., the 1/16 millisecond segment of time corresponding to the 
termination of the coherent integration interval), or may nm anytime during the 
period before the next non-coherent integrator update. In one embodiment, an 
approximate phase delay determination corresponding to a phase delay of the 
greatest magnitude is confirmed by determining whether a magnitude of the 
phase delay difference of first and second greatest correlation magnitudes is equal 
to one unit of phase delay. 

A coherent RAM capture function, part of the output register block 
91 shown in FIG. % can be used to capture and buffer (to be read later by tiie 
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controlling processor) the coherent sums from one or more coherent memory 
spaces corresponding to a the actual code phase delay for one or more detected 
satellites. The coherent RAM capture buffer can then be read by the controlling 
processor in order to directly demodulate the satellite transmitted 50 BPS data 
sequence using traditional methods of demodulating the bi-phase data 
modulation. It is through this path that the receiver can collect real-time satellite 
transmitted ephemeris, clock correction, almanac, UTC offset, and ionospheric 
delay data as well as precise time encoded in the HOW word. 

The search for all phase delays of a single spread spectrum signal at 
one Doppler frequency is discussed. The PN code generator is clocked at a rate at 
least 32 times its normal rate of 1.023 MHz and the R3 register is clocked at least 
twice that to produce an R3 register clocking rate of at least 32 * 1.023 MHz. At 
this rate, the PN code generator rolls one complete cycle in half the time it takes to 
collect the next signal segment in Rl, 1/16* of 1 millisecond, and the replica code 
segment of the R3 register takes on all possible 2046 states (delays). 

In the processing flow diagram of FIG. 10a, while the Rl and R2 
registers store the signal and NCO segments as discussed above, the PN 
generator is clocked 1023 times and the R3 register is clocked 2046 times so that 
all possible 0.5 chip samples are compared to the Rl times R2 complex product 
formed at multiplier block 326. Each of the 2046 states of the R3 register produce 
corresponding SUMI and SUMQ correlator outputs. In FIG. 4 , these outputs are 
delivered to the N-millisecond coherent accumulation RAM 334 via the adder. 
Each of the 2046 outputs of the SUMI and SUMQ correlator outputs are added to 
each of the 2046 possible in-phase and quadrature memory words of the N- 
millisecond coherent accumulation words. Each word in the coherent 
accumulation RAM represents one each of the 2046 possible 0.5 chip PN phase 
delays. For each one of the sixteen segments, the Rl and R2 registers store and 
hold their corresponding data. The R3 register takes on all 2046 code phase states 
and the output of ttie correlator during each of these states is added to flie 
successive addresses in the coherent accumulation RAM. When the next segment 
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is collected in the Rl and R2 register, the process repeats. After all 16 segments 
are processed, the coherent accumulation RAM contains the correlation sum for 
all possible 0.5 chip PN phase delays for 1 millisecond of processing. The 
coherent accumulation sum can continue for some programmable integer number 
of milliseconds. 

The correlator may also generate correlation results for more than 
one satellite. In FIG. 4, multiplexers 328 and 330 select more than one source of 
Doppler signals and more than one source of PN code signals, respectively. By 
properly timing the multiplexers, the correlator can compute more than one 
correlation output for a shortened code phase test range. The design is such that 
only one time-shared PN code generator and only one Doppler NCO generator 
are necessary to accomplish the processing over all possible satellite signals, by 
time multiplexing. 

The 3rd entry in the Table of FIG. 3 illustrates, for example, two 
satellites each searched at 4 different Doppler frequencies and 512 half-chips of 
code phase uncertainty space. It is assumed that the code phase of the signal is 
known with sufficient accxiracy to place it within 512 half-chips of delay. This is 
possible under many conditions of autonomous GPS and in assisted operation 
modes. The controlling processor configures the receiver for a particular mode by 
writing to control registers. The coherent and non-coherent accumulation RAM is 
divided into 8 segments, each 512 words long. Each of the 8 segments can be 
mapped to the 512 half-chips of code phase desired by timing the multiplexer 330 
and by programming a pre-correlation shift into the PN code generator 320 
driving the R3 register 324. This maps 8 particular code phase search ranges into 
the 4096 word available accumulation RAM, the available code phase search 
range being 4096/N, where N is the nmnber of satellites observed. When N=l or 
N=2, the maximum code phase search space is 2046 half-chips per Doppler due to 
the PN code sequence length. 

The exemplary embodiment exhibits 0.5 chip sample spacing. The 
sample spacing may be reduced to 0.25 chip or 0.125 chip spacing by increasing 
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the clock rate of the circuits driving the Rl, R2, and R3 registers by factors of 2X 
or 4X, respectively. Note that this reduces the range of code phase searching by 
the same factor, for example, if 1023 chips are tested at 0.5 chip spacing, then by 
doubling the sample rate of the Rl, R2 and R3 register driving circuits, the system 
is capable of testing approximately 511 chips of code phase delay at 0.25 chip 
spacing to produce a more accurate measurement of code phase because Ihe 
system will create more closely spaced correlation samples. The sample spacing 
may be increased also to single chip spacing to extend the number of searchers by 
causing a 0.5X clock rate on the ISIP output samples, thus extending the number 
of searchers beyond 4096. For example, the following table defines the four 
possible states of the receiver data input rate. 

Receiver Data Input Rate Sample Spacing, in Chips 

1,023 MHz IChip 
2.046 MHz Chips 
4,096 MHz Vi Chips 

8.184 MHz 1/8 Chips 

The total mmiber of RAM cells remain at 4092, but the allocation of 
the measurement RAM is in steps of 1, y% Va, or 1/8 chip spacing. In one 
embodiment, as many as four satellites can be tested for complete code phase test 
reinge simultaneously (at one chip spacing). In another embodiment, as many as 
12 satellites can be tested with reduced phase delay as long as the total number of 
phase delays tested does not exceed 4092 one-eighth chips total. 

An alternative method of accomplishing Vz chip or 1 chip spacing is 
to increment the phase of the PN code stored in the R3 register on each 
subsequent clock cycle by two half-chips per iteration, whereby only 1023 
possible phase delays are created across the entire PN code length, thus allowing 
for up to four full phase delays to be tested by the 4092 correlators. For this 
method, the Rl register stores samples at Vi chip spacing, regardless of whether 
the R3 register is advanced by or 1 chip per clock. 
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FIG. 10b shows how this is accomplished. As discussed earlier, the 
input samples from the Rl register and the R2 register are multiplied to produce a 
product register P. The product register P is then held constant while a pre- 
determined range of code phase delays of the replica PN generator (stored in the 
R3 register) are applied to a final multiplier, all 128 states of flie multiplier 
outputs are then summed together in summer block 333. 

The R3 register contains 128 states of the replica PN code for a 
particular code phase delay. For each code phase delay, the coherent RAM 
memory 334 is used as a large number of separate accumulators, one for each of 
the possible code phase delays tested. In a one-chip mode, the contents of the R3 
register are advanced two half-chips of delay per clock cycle, and the coherent 
RAM 334 is advanced one address so that each memory location in the coherent 
RAM represents successive one chip delay trials of code phase delay. For 
example, the results of the correlation of the R3 register for delay zero is vmtten 
into the delay-zero accumulator address, the results of the correlation of the R3 
register for delay one (one-chip different) is written into the delay-one 
accumulator address and so-forth. Up to 1023 full-chips are tested per satellite 
representing all possible code phase delays, or a pre-determined number of code 
phase delays are tested, filling only the corresponding number of Coherent RAM 
memory locations. 

Every one of the 1023 possible states of the R3 register form a 
unique SUMI and SUMQ output that is stmmied into the coherent accumulation 
RAM 334 in successive memory locations. The receiver is programmable so that 
coherent accximulation is over N milliseconds, each satellite programmable over a 
different value of N by storing an N value for each satellite searched and 
applying the N value to the coherent accumulator when the particular satellite is 
being searched. This alternative mode requires that signal samples be loaded into 
the Rl register at y^ chip spacing, but the R3 register, whose spacing is y2 chip, is 
advanced at one chip (two V2 chips) per system clock cycle. 
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In one embodiment, less than half chip spacing is generated using V2 
chip Rl and R2 spacing by assigning multiple correlation trials to the same 
satellite, each correlation trial shifted in time a small code phase offset (i.e., less 
than y% chip spacing) to the segment FN code generator. In FIG. 14, for example, 
the first code replica signal segment produces outputs at 0, 0.5, 1.0, 1.5 chip 
spacing. The second code replica signal segment is offset from the first signal 
segment one by l/8th chip, producing outputs at 0.125, 0.625, 1.125, 1,625 chip 
spacing. The third code replica signal segment is offset by 2/8*^ chip, producing 
outputs at 0.250, 0.750, 1.250, 1.750 chip spacing. The fourth code replica signal 
segment is offset by 3/8ths chip, producing outputs at 0.375, 0,875, 1.375, 1.875 
chip spacing and so forth. In this way, 1/8 chip spacing outputs can be 
produced. Generally a difference between the first and second time shifts of the 
replica signal is a fractional multiple of a time interval of the sample rate. 

Offsetting by less than y% chip delay is accomplished by setting the 
Delta_Cp parameter to a smaU quantity of less tiian 770 carrier cycles. For 
example, to implement 1/8 chip offsets, the Delta_Cp parameter is set to 1/8 * 
1540, or 192.5 carrier cycles. This offsets the corresponding 
Fractional_Code_Phase[i] term in the Code Phase Computer block for the affected 
satellite, each test bin offset by a successively increasing amoimt of 192.5 carrier 
cycles, so that the bins that are offset further will roll over the 770 carrier cycle 
limit sooner, thus appl3dng a code phase correction (of a discrete Vi chip stepsize) 
sooner than the other lesser offsets. That is, the code phase correction steps are 
still y2 chip physical steps, but the time in which one bin applies a correction is 
different from a subsequent bin that is proportional to the Delta_Cp parameter, 
which causes an average offset representing a code phase delay that is much less 
than chip on each of the affected bins. 

In FIG. 4, the coherent sxmis after N milliseconds can then be passed 
ihrough a magnitude detector and added to an M millisecond non-coherent 
acamiulation RAM 326. The number M is also programmable. The larger the M 
interval, the higher the signal processing gain. The N millisecond interval of 
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coherent integration sets the bandwidth of the frequency search as 1/ (0.001 * N) 
Hz. 

In an N millisecond coherent/M millisecond non-coherent 
integration mode of operation, the number of searchers and code phase and 
Doppler for each searcher are programmable in the receiver. When the time of 
arrival (TOA) of data bit edges is known, the receiver may operate in an N 
millisecond predicted bits coherent integration/M millisecond non-coherent 
integration mode, where N is typically greater than or equal to 20 milliseconds 
(GPS bit time) up to and including 200 milliseconds. In a time Offset Search Mode, 
upon signal detection, the receiver searches for a particular 50 BPS data pattem 
sequence to arrive, as described for example in copending patent United States 
patent Application No. 09/539,137 entitled "Method and Apparatus For 
Determining Time in A GPS Receiver", assigned commonly with the present 
application and incorporated by reference herein. 

In a cpntinuous tracking mode, the receiver continuously tracks and 
demodulates the 50 BPS satellite transmitted data directly from all satellites in 
view. In a quick search mode, the receiver executes a rapid scan for aU visible 
satellites without having the processor intervene imtil the entire sequence is 
complete, wherein one interrupt is generated at the end of the scan. The receiver 
may also resume the detection process where it left off when power is re-applied. 

The receiver also operates in a bit synchronization and demodulation 
mode wherein data message bit, for example the GPS 50 BPS navigation data 
message, edges are located for improved signal to noise performance. In this 
mode, knowledge of the time of arrival of the transmitted data bit edges is not 
known, but the code phase delay to a satellite of interest has been already been 
determined. 

In FIG. 15, in the exemplary embodiment, 640 (about 15 percent) of 
the 4096 correlator bins are required to determine the bit S3nichronization time 
delay for one or more satellites. The parameter tiiat varies between vertical bins is 
a time associated with the integration and dump process, all integrate and dump 
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processes are set to a pre-detection integration (PDI) value of 20 milliseconds 
consistent with the data bit time. The delay of the dump command for each of the 
20 integrators changes by one millisecond per vertical bin so as to cover all 
possible delays of the data bit change. The "Doppler" and "CP^OFFSET" 
parameters illustrated in FIG. 2 are not varied vertically. Since the PDI parameter 
is 20 milliseconds, only one of the vertical bins is aligned perfectly with the 
transmitted 50 BPS data edge. The start of integration for all 20 bins is 
synchronized to the Rl segment closest to PN code phase 0. That particular bin 
will integrate non-coherently to a value that is greatest compared to those bins that 
are misaligned once many data transitions are observed. Also, only one of the 
vertical bins will be aligned so as to maximally cancel in the coherent sum (when 
data transitions are present). Another measure of bit-alignment is by locating the 
bin that integrates to a minimum after many data transitioiis are summed. When 
the maximum to minimxrai non-coherent integrator siun is approximately one-half 
tile data bit time, 10 ms in the exemplary embodiment, bit-synchronization is 
confirmed. In some applications, for example, those with high noise content, a 
pass threshold may be established at 9, 10, or 11 milliseconds or some other range 
that is approximately half the data bit time. Using this method, the receiver can 
find bit-sync down to signal levels approaching 20 dB-Hz, at least 10 dB better 
than is possible with traditional bit sjoic methods. 

Alternate methods may be used to confirm bit sjmchronization. For 
example, the difference between the maximum sum and the sirni which began 
integration 1 ms earlier can be computed. A second difference can be computed 
between the maximum sum and the sum which began integration 1 ms later. In 
one embodiment, if these two differences are within some tolerance of each other, 
say 10% of the expected correlation difference, bit-synchronization is confirmed. 
This alternative method does not depend upon a minimum simi, which may be 
susceptible to noise. Other techniques can use other sums near the maximum 
sum. 
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In FIG. 16, a plurality of coherent correlations at the determined code 
phase delay between the first spread spectrum signal and a first replica signal are 
performed for a time interval corresponding to the data bit time (20 ms in the 
example). The plurality of coherent correlations, 0-19 in FIG. 16, correspond in 
number to the integer number (20) of the repeat time of the pseudorandom code 
bits. Each of the coherent correlations is offset relative to a previous correlation by 
the repeat time (1ms in FIG. 16) of the pseudorandom code bits. A magnitude of 
each of the plurality of coherent correlations is deternuned, and a plurality of non- 
coherent sxims of magnitudes for each of the plurality of coherent correlations are 
generated over at least two data bit times (20 ms). The plurality of non-coherent 
sums of magnitudes correspond in munber to the integer number of the repeat 
time of the pseudorandom code bits. The results of the sequential partial coherent 
correlations are stored in a corresponding plurality of memory locations 
corresponding in number to the integer number of the repeat time, and the 
stunmed sequential partial correlation results are stored in the plurality of 
memory locations. 

In FIG. 16, the non-coherent signal magnitude is plotted vertically 
relative to the time offset. The peak energy exists at bin 6 and the minimum 
energy exists at bin 16 at the corresponding time offeets, the difference of which is 
approximately 10 ms (one-half flie data bit time), which confirms that bit-sync 
detection has occurred. The offset time of the non-coherent sum having tiie 
maximum magnitude corresponds to the bit synchronization offset time of the 
data bit message. A local time correction may be determined dependent on the 
arrival time of the data bit edge and the local time of a real-time clock. 

Once bit sync time has been determined for one satellite using the 
technique discussed above (or any other method based on some other correlator 
architecture, including the prior art correlators cited in the instant application, 
among others), knowledge of the time of arrival of the data bit edge for one 
satellite can be used to time synchronize the start of flie 20 millisecond integrate 
and dump process, or coherent integration, in the coherent integrator to integrate 
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over the entire data bit time, avoiding the data transitions during the integration 
process^ and maximizing the signal to noise ratio. 

Generally, data bit synchronization of a receiver with a plurality of 
signals based on a known bit synch offset time for one signal, approximate 
receiver and signal source locations, and signal source clock correction parameters 
for each of the plurality of signal may be performed by determining a propagation 
time between iiie receiver and sources of each of the plurality of signals, 
determining a clock error corrected propagation time from the approximate 
position of the receiver to the source of each of the plurality of signals based on the 
corresponding propagation time and the corresponding signal source clock 
correction parameters, and determining an adjusted bit sync offset time for each of 
the plurality of signals for which bit S3mc offset time is not known based on Ae 
corresponding clock error corrected propagation time for each signal and based in 
the known bit synch offeet time and on the clock error corrected propagation time 
of the signal for which bit synch offset time is known. 

In one embodiment, the signals are satellite based spread spectrum 
signals and the signal source location is determined by derivation thereof from 
either ephemeris or almanac data and satellite time. 

The clock error corrected propagation time, PTCp], for each of the 
signals of interest may be determined by computing PTCp] = VT[V\ + Cpf), where 
PT(I) is the corresponding propagation time and C(I) is a corresponding signal 
source clock correction based on the signal source dock correction parameters. 
The propagation time, PTp], is determined from the approximate position of the 
receiver to each of the signal sources by dividing the range, R\J], between the 
receiver and the signal source by the speed of light. 

The adjusted bit sync offset time, BSOT[I], to each of the plurality of 
signal sources for which bit sync offset time is not knovm is determined by 
calculating BSOT[I] = BSOTpCj + (PTCIIl - PTCpq), where BSOT[K] is the known 
bit synch offeet time and PTCpC] is the a clock error corrected propagation time for 
the signal for which bit synch offset time is known. In one embodiment, each of 
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the plurality of signals is integrated coherently for 20 milliseconds, and a start time 
of each coherent integration is offset in relation to the corresponding bit synch 
offset time, BSOTP]. 

In the exemplary embodiment satellite based spread spectrum signal 
embodiment, underflow or overflow is corrected so that BSOTp] is always in a 
range of 0 to 20 milliseconds. This can be accomplished with software using a 
simple algorithm, for example: while(BSOT[I] > 20) BSOipi = BSOTH - 20; and 
while(BSOT[I] < 0) BSOTp] = BSOTp] + 20. 

In the exemplary embodiment, these steps will time-align the 
beginning of the 20 millisecond coherent integration intervals for all satellites 
based on the bit-sync offset time from one satellite, thus maximizing the signal 
processing gain for all satellites. 

Although the exemplary embodiment and applications are discussed 
in the context spread spectrum signals transmitted from satellites, those having 
ordinary skill in the art will xmderstand that many of the methods and 
architectures of the present invention are applicable for searching and 
synchronizing spread spectrum signals having other origins, for example those 
from terrestrial based communications systems. 

While the present inventions and what is considered presently to be 
the best modes thereof have been described in a manner that establishes 
possession thereof by the inventors and that enables those of ordinary skill in the 
art to make and use the inventions, it will be understood and appreciated that 
there are many equivalents to liie exemplary embodiments disclosed herein and 
that myriad modifications and variations may be made thereto without departing 
from the scope and spirit of the inventions, which are to be limited not by the 
exemplary embodiments but by the appended claims. 

What is claimed is: 
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CLAIMS 

1. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

receiving a first spread spectrum signal; 

forming a first signal segment by segmenting a first portion of the 
first spread spectrum signal; 

forming a second signal segment by segmenting a second portion of 
the first spread spectrum signal after forming the first signal segment; 

partially correlating the first signal segment with a corresponding 
first replica signal segment for all phase delays in a time interval not greater than 
that required to form the second signal segment; 

storing first partial correlation results for all phase delays of the first 
signal segment in corresponding memory locations. 

2. The method of Claim 1, 

forming a third signal segment by segmenting a third portion of the 
first spread spectrum signal; 

partially correlating the second signal segment with a corresponding 
second replica signal segment for all phase delays while forming the third signal 
segment; 

storing second partial correlation results for all phase delays of the 
second signal segment in corresponding memory locations. 

3. The method of Claim 2, forming summed partial correlation 
results for all phase delays by adding the first and second partial correlation 
results, storing the siunmed partial correlation results for all phase delays in 
corresponding memory locations. 



• 

wo 03/017503 



PCTAJS02/22961 



41 

4. The method of Qaim 1, separating a last signal sample of the fixst 
signal segment from a first signal sample of the second signal segment by a single 
sample. 

5. The metfiod of Qaim 1, partially correlating the first signal 
segment with the corresponding first replica signal segment for all phase delays in 
one chip or half-chip increments based on a processor controlled input. . 

6. A method for code phase searching a spread spectnmi signal 
having a repeating sequence of bits, comprising: 

receiving a first spread spectrum signal; 

forming a plurality of signal segments having unequal numbers of 
samples by segmenting the first spread spectrum signal; 
storing each signal segment; 

partially correlating each signal segment with a corresponding first 
replica signal segment for a predetermined number of phase delays in a correlator. 

7. The method of Qaim 6, communicating a segment length signal to 
the correlator for at least some of the signal segments partially correlated. 

8. The method of Qaim 6, storing partial correlation resxilts for each 
of the predetermined phase delays of each signal segment in corresponding 
memory locations. 

9. The method of Qaim 6, forming summed correlation results for 
each of the predetermined phase delays by adding partial correlation results for 
each of the signal segments, storing the summed correlation results for each of the 
predet^mined phase delays in corresponding memory locations. 
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10. The method of aaim 6, storing each signal segment in a first 
register, partially correlating one signal segment while forming a subsequent 
signal segment, storing the subsequent signal segment in the first register after 
correlating the previous signal segment. 

11. The method of Qaim 6, separating a last signal sample of one 
signal segment from a first signal sample of a subsequent signal segment by a 
single sample. 

12. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals; 
forming a plurality of signal segments having unequal numbers of 
samples by segmenting the plurality of spread spectrum signals; 
storing each signal segment; 

partially correlating each signal segment with corresponding 
segments of a plurality of replica signals for a predetermined number of phase 
delays, 

each of the plurality of replica signals corresponding to one of the 
plurality of spread spectrum signals. 

13. The method of Qaim 12, partially correlating each signal 
segment with corresponding segments of a pliurality of replica signals for a 
predetermined nimiber of phase delays virtually in parallel by sequentially 
correlating at a rate greater than a rate at which the plurality of spread spectrum 
signals are received. 

14. The method of Claim 12, partially correlating each signal 
segment wilh corresponding segments of the plurality of replica signals for the 
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predetermined number of phase delays in one chip or half-chip increments based 
on a processor controlled input. 

15. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals; 
forming a plurality of signal segments by segmenting the plurality of 
spread spectrum signals; 

storing each signal segment; 

partially correlating each signal segment with corresponding 
segments of a plurality of replica signals for all phase delays over the repeating 
sequence of bits, 

each of the plurality of replica signals corresponding to one of the 
plurality of spread spectrum signals. 

16. The method of Claim 15, forming summed partial correlation 
results for all phase delays of the plurality of spread spectrum signals by adding 
ttie partial correlation results for each of the plurality of signal segments correlated 
with corresponding segments of a particular replica signal, the partial correlation 
results for each segment added to the partial correlation results for a previous 
signal segment upon completion of partially correlating each signal segment 

17. The method of Qaim 15, separating a last signal sample of one 
signal segment from a first signal sample of a subsequent signal segment by a 
single sample. 



18. A method for code phase searching a plurality of spread 
spectrum signals, having repeating sequences of bits, from corresponding signal 
sources, comprising: 
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identifying attributes for a plurality of spread spectrum signals 

received; 

determining a code phase search range for each of the spread 
spectrum signals based upon attributes identified for the corresponding spread 
spectrum signals; 

forming a plurality of signal segments by segmenting the plurality of 
spread spectrum signals; 

partially correlating each signal segment with corresponding 
segments of a plurality of replica signals for the code phase search range 
determined, 

each of the plurality of replica signals corresponding to one of the 
plurality of spread spectrum signals. 

19. The method of Qaim 18, 

forming summed partial correlation results for the determined code 
phase search range by adding the partial correlation results for each of the 
plurality of signal segments correlated with a particular replica signal, the partial 
correlation results for each segment added to the partial correlation results for a 
previous signal segment upon completion of partially correlating each signal 
segment. 

20. The method of Qaim 18, determining a different code phase 
search range for at least some of the spread spectrum signals based upon attributes 
identified for the corresponding spread spectrum signals. 

21. The method of Claim 18, partially correlating each signal 
segment with corresponding segments of the plurality of replica signals for the 
code phase search range determined in one chip or half-chip increments based on 
a processor controlled input 
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22. A method for code phase searching a plurality of spread 
spectrum signals having repeating sequences of bits from corresponding signal 
sources, comprising: 

determining a Doppler search range having at least one Doppler Bin 
with a corresponding Doppler frequency for each of a plurality of spread spectrum 
signals received; 

segmenting each Doppler bin into a plurality of Doppler segments; 

forming a plurality of signal segments by segmenting the plurality of 
spread spectrum signals; 

sequentially forming a plurality of product signal segments for each 
of the plurality of signal segments by multiplying each signal segment by 
corresponding Doppler segments for each of the plurality of Doppler bins for each 
of the spread spectrum signals; 

partially correlating each of the product signal segments with 
corresponding segments of a plurality of replica signals for a predetermined 
number of phase delays, each of the plurality of replica signals corresponding to 
one of the plurality of spread spectrum signals. 

23. The method of Claim 22, sequential partially correlating each of 
the product signal segments by partially correlating one product signal segment 
before partially correlating a subsequently formed product signal segment 

24. A method for code phase searching a plurality of spread 
spectrum signals having repeating sequences of bits from corresponding signal 
sources, comprising: 

forming a plurality of signal segments by segmenting a spread 
spectrum signals received; 

forming a plurality of first Doppler signal segments; 

forming a first product signal segment by multipl)dng a first signal 
segment by a first Doppler signal segment; 
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partially correlating the first product signal segment with a 
corresponding segment of a replica signal for a predetermined number of phase 
delays; 

forming a second product signal segment by multiplying the first 
signal segment by a second Doppler signal segment; 

partially correlating the second product signal segment with a 
corresponding segment of a replica signal for a predetermined niunber of phase 
delays after partially correlating the first product signal segment. 

25. The method of Qaim 24, the spread spectrum signal having a 
repeating sequence of pseudorandom code bits modulated with data message bits 
having a data bit time tiiat is an integer number of a repeat time of the 
pseudorandom code bits, 

mxiltiplying a result of partially correlating the first product signal 
segment with a replica data modulation signal segment corresponding in time to 
the partially correlated first product signal segment; 

after multiplying, accumulating partial correlation results for a time 
period exceeding the data bit time. 

26. The method of Qaim 25, preventing the partial correlation of 
imknown data message bits by partially correlating the unknown data message 
bits with corresponding zero magnitude portions of the replica data modulation 
signal. 

27. The method of Claim 24, separating a last signal sample of the 
first Doppler signal segment from a first signal sample of the second Doppler 
signal segment by a single sample. 
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28. A method for searching spread spectrum signals having a 
repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals; 

determining a Doppler search range comprising at least one Doppler 
Search bin for at least one of the pltirality of spread spectrum signals; 

correlating virtually in parallel at least one of the plurality of spread 
spectrum signals over the corresponding Doppler search range for a 
predetermined number of phase delays by sequentially correlating at a rate greater 
than a rate at which the spread spectrum signal is received. 

29. A method for searching a spread spectrum signal having a 
repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals; 

correlating virtually in parallel the plurality of spread spectrum 
signals over a predetermined number of phase delays by sequentially correlating 
the plurality of spread spectrum signals with corresponding replica signals at a 
rate greater than a rate at which the spread spectrum signals are received. 

30. The method of Claim 29, correlating virtually in parallel at least 
one of the pliuality of spread spectrum signals over a corresponding Doppler 
search range by sequentially correlating with Doppler signals within the 
corresponding Doppler search ranges at a rate greater than a rate at which the 
spread spectrum signals are received. 

31. A method for searching a plxirality of spread spectrum signals 
having repeating sequences of bits from corresponding signal sources, comprising: 

receiving a plurality of spread spectrum signals; 
assigning a Doppler search range to each of the spread spectrum 
signals received; 
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generating at least one Doppler search signal for each of the assigned 
Doppler search ranges, the number of Doppler search signals generated for at least 
two of the Doppler search ranges being different. 

32. The method of Qaim 31, generating the Doppler search signals 
totaling in number not more than a predetermined number of Doppler search 
signals. 

33. The method of Claim 31, applying a Doppler step size increment 
between each of the Doppler search signals within a particular Doppler search 
range. 

34. The method of Claim 33, applying different Doppler step size 
increments for different spread spectrum signals 

35. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals at a sample rate; 

forming a plurality of signal segments by segmenting the plurality of 
spread spectrum signals; 

partially correlating the plurality of signal segments with 
corresponding segments of a first replica signal having a first time shift for a 
predetermined number of phase delays, the first replica signal corresponding to 
one of the plurality of spread spectrum signals; 

partially correlating the plurality of signal segments with 
corresponding segments of the first replica signal having a second time shift for a 
predetermined nimiber of phase delays. 
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36. The method of Claim 35, selecting the first and second time shifts 
so that a difference between the first and second time shifts is a fractional multiple 
of a time interval of the sample rate. 

37. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

accumulating a magnitude of results of partially correlating a 
plurality of signal segments with corresponding replica signal segments for a 
predetermined number of phase delays; 

determining which of the accumulated magnitudes is greatest; 

determining whether a subsequent accumulation of the greatest 
accumulated magnitude will result in an accumxilation overflow; 

scaling all subsequent acctimulated magnitudes for all of the 
predetermined phase delays with a common scaling factor if it is determined that 
the subsequent accumulation of the greatest accumulated magnitude will result in 
an accumulation overflow; 

storing the common scaling factor. 

38. The method of Qaim 37, partially correlating each of the plurality 
of sigiial segments with the corresponding replica signal segments for the 
predetermined number of phase delays for a coherent integration time period. 

39. The method of Qaim 37, scaling all subsequent accumulated 
magnitudes for all of the predetermined phase delays with a common scaling 
factor in non-coherent memory. 



40. A method for code phase searching a spread spectrum signal 
having a repeating sequence of bits, comprising: 

receiving a plurality of spread spectrum signals; 
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correlating of the spread spectrum signals with a plurality of replica 
signals for a predetemiined nxunber of phase delays, each of the plurality of 
replica signals correspondiag to one of the plurality of spread spectnim signals; 

scaling at least some results of correlating each of the spread 
spectrum signals by different scale factors. 

41. The method of Qaim 40, preventing accimiulation overflow by 
adjusting the scale factor for each of the plurality of spread spectrum signals. 

42. The method of Qaim 40, scaling at least some results of 
correlating each of the spread spectrum signals by different scale factors in 
coherent memory. 

43. A method for code phase searching a spread spectrtun signal, 

comprising: 

determining a correlation magnitude for a predetermined number of 

phase delays; 

determining which of the correlation magnitudes for the 
predetermined ntimber of phase delay is greatest and second greatest; 

determining a difference of phase delays of the first and second 
greatest correlation magnitudes; 

confirming an approximate phase delay determination 
corresponding to a phase delay of the greatest magnitude by determining whether 
a magnitude of the phase delay difference is equal to one xmit of phase delay. 

44. A spread spectrum signal receiver comprising: 

a signal segmentation circuit having an n-bit signal segment output 
coupled to an input of an n-bit signal segment register; 

a first n-bit replica signal segment register; 
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a multiplier circuit having a signal segment input coupled to an 
output of the n-bit signal segment register, the multiplier circuit having a replica 
signal segment input coupled to an output of the n-bit replica signal segment 
register; 

a correlator having an input coupled to an output of the multiplier 

circuit; 

a coherent integrator having an input coupled to an output of the 

correlator. 

45. The receiver of Qaim 44, a second n-bit replica signal segment 
register, a first multiplexor having an output coupled to the replica signal segment 
input of the multiplier circuit, the outputs of the first and second n-bit replica 
signal segment registers coupled to inputs of the first multiplexor. 

46. The receiver of Qaim 45, first and second Doppler signal 
segment registers, a second multiplexor having first and second inputs coupled to 
corresponding outputs of the first and second Doppler signal segment registers, an 
output of the second multiplexor and an output of the signal segment register 
coupled to a first multiplier circuit, an output of the first multiplier circuit and the 
output of the first multiplexor coupled to inputs of a second multiplier circuit 
having an output coupled to the correlator. 

47. The receiver of Qaim 44, receiver quality clock coupled to the 
signal segmentation circuit and the n-bit signal segment register, a second non- 
receiver quality clock coupled to the first n-bit replica signal segment register, the 
multiplier circuit, the correlator and the coherent integrator. 

48. The receiver of Qaim 44, the receiver being driven by at least 
two clock sources, one of said clock soiurces being of receiver quality clock stability 
coupled to the signal segmentation circuit and the n-bit signal segment register. 
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and another of said clock sources being of non-receiver quality clock stability, said 
first clock source coupled to the first n-bit replica signal segment register, the 
multiplier circuit, the flash correlator and the coherent integrator. 

49. The receiver of Qaim 44, the signal segmentation circuit for 
forming a plurality of signal segments having unequal numbers of samples not 
greater than n-bits. 

50. The receiver of Claim 44, the correlator for correlating at a rate 
greater than a rate at which the spread spectrum signal is received. 
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U SEGMENT LENGTH=32 ^ 

r (IN HAIF-CHIPS) 



csl 

II 

CO 



C9 
UJ 
CO 



INT-DUMP TIME 0FFSET=19 ms 

INT-DUMP TIME 0FFSET=18 ms 

INT-DUMP TIME 0FFSET=17 ms 

INT-DUMP TIME 0FFSET=16 ms 

INT-DUMP TIME 0FFSET=15 ms 
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CODE PHASE 





COHERENT 
INTEGRATION 
> PERIOD 
FOR ALL 
IS 20 ms 



INT-DUMP TIME 0FFSET=4 ms 

INT-DUMP TIME 0FFSET=3 ms 

INT-DUMP TIME 0FFSET=2 ms 

INT-DUMP TIME 0FFSET=1 ms 

INT-DUMP TIME OFFSET=0 ms 
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